NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Constant Stepsize Local GD for Logistic Regression: Acceleration by Instability

Crawshaw, Michael; Woodworth, Blake; Liu, Mingrui (July 2025, International Conference on Machine Learning)

Free, publicly-accessible full text available July 23, 2026
Local Steps Speed Up Local GD for Heterogeneous Distributed Logistic Regression

Crawshaw, Michael; Woodworth, Blake; Liu, Mingrui (March 2025, International Conference on Learning Representations)

Free, publicly-accessible full text available March 1, 2026
Lower bounds for non-convex stochastic optimization

https://doi.org/10.1007/s10107-022-01822-7

Arjevani, Yossi; Carmon, Yair; Duchi, John C.; Foster, Dylan J.; Srebro, Nathan; Woodworth, Blake (January 2022, Mathematical Programming)

Full Text Available
An Even More Optimal Stochastic Optimization Algorithm: Minibatching and Interpolation Learning

Woodworth, Blake E.; Srebro Nathan (January 2021, Advances in neural information processing systems)

Full Text Available
Mirrorless Mirror Descent: A Natural Derivation of Mirror Descent

Gunasekar, Suriya; Woodworth, Blake; Srebro, Nathan (January 2021, Proceedings of Machine Learning Research)
null (Ed.)
We present a direct (primal only) derivation of Mirror Descent as a “partial” discretization of gradient flow on a Riemannian manifold where the metric tensor is the Hessian of the Mirror Descent potential function. We contrast this discretization to Natural Gradient Descent, which is obtained by a “full” forward Euler discretization. This view helps shed light on the relationship between the methods and allows generalizing Mirror Descent to any Riemannian geometry in Rd, even when the metric tensor is not a Hessian, and thus there is no “dual.”
more » « less
Full Text Available
On the Implicit Bias of Initialization Shape: Beyond Infinitesimal Mirror Descent

Azulay, Shahar; Moroshko, Edward; Nacson, MS; Woodworth, Blake; Srebro, Nathan; Globerson, Amir; Soudry, Daniel (July 2021, Proceedings of Machine Learning Research)
null (Ed.)
Recent work has highlighted the role of initialization scale in determining the structure of the solutions that gradient methods converge to. In particular, it was shown that large initialization leads to the neural tangent kernel regime solution, whereas small initialization leads to so called “rich regimes”. However, the initialization structure is richer than the overall scale alone and involves relative magnitudes of different weights and layers in the network. Here we show that these relative scales, which we refer to as initialization shape, play an important role in determining the learned model. We develop a novel technique for deriving the inductive bias of gradientflow and use it to obtain closed-form implicit regularizers for multiple cases of interest.
more » « less
Full Text Available
The min-max complexity of distributed stochastic convex optimization with intermittent communication

Woodworth Blake E.; Bullins Brian; Shamir Ohad; Srebro Nathan (January 2021, Conference on Learning Theory)

Full Text Available
A Stochastic Newton Algorithm for Distributed Convex Optimization

Bullins Brian; Patel Kumar K.; Shamir Ohad; Srebro Nathan; Woodworth Blake (January 2021, Advances in neural information processing systems)

Full Text Available
Minibatch vs Local SGD for Heterogeneous Distributed Learning

Woodworth, Blake; Patel, Kumar Kshitij; Srebro, Nathan (June 2020, Advances in Neural Information Processing Systems 33 (NeurIPS 2020))
null (Ed.)
Full Text Available
Guaranteed Validity for Empirical Approaches to Adaptive Data Analysis

Rogers, Ryan; Roth, Aaron; Smith, Adam; Srebro, Nathan; Thakkar, Om; Woodworth, Blake (August 2020, Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics)
null; null; null; null (Ed.)
Full Text Available

« Prev Next »

Search for: All records